Overview

Dataset statistics

Number of variables40
Number of observations30236
Missing cells0
Missing cells (%)0.0%
Duplicate rows27
Duplicate rows (%)0.1%
Total size in memory9.2 MiB
Average record size in memory320.0 B

Variable types

BOOL20
NUM17
CAT3

Reproduction

Analysis started2020-06-05 16:54:48.628481
Analysis finished2020-06-05 16:55:41.352401
Duration52.72 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

Dataset has 27 (0.1%) duplicate rows Duplicates
Number_of_announced_NLRI_prefixes is highly correlated with Number_of_announcements and 1 other fieldsHigh correlation
Number_of_announcements is highly correlated with Number_of_announced_NLRI_prefixes and 2 other fieldsHigh correlation
Average_edit_distance is highly correlated with Maximum_AS-path_lengthHigh correlation
Maximum_AS-path_length is highly correlated with Average_edit_distanceHigh correlation
Maximum_AS-path_length_10 is highly correlated with Maximum_edit_distance_7High correlation
Maximum_edit_distance_7 is highly correlated with Maximum_AS-path_length_10High correlation
Maximum_AS-path_length_11 is highly correlated with Maximum_edit_distance_8High correlation
Maximum_edit_distance_8 is highly correlated with Maximum_AS-path_length_11High correlation
Maximum_AS-path_length_12 is highly correlated with Maximum_edit_distance_9High correlation
Maximum_edit_distance_9 is highly correlated with Maximum_AS-path_length_12High correlation
Maximum_AS-path_length_13 is highly correlated with Maximum_edit_distance_10High correlation
Maximum_edit_distance_10 is highly correlated with Maximum_AS-path_length_13High correlation
Maximum_AS-path_length_14 is highly correlated with Maximum_edit_distance_11High correlation
Maximum_edit_distance_11 is highly correlated with Maximum_AS-path_length_14High correlation
Maximum_AS-path_length_15 is highly correlated with Maximum_edit_distance_12High correlation
Maximum_edit_distance_12 is highly correlated with Maximum_AS-path_length_15High correlation
Number_of_Interior_Gateway_Protocol_IGP_packets is highly correlated with Number_of_announcements and 1 other fieldsHigh correlation
Number_of_incomplete_packets is highly correlated with Number_of_announcementsHigh correlation
type is highly correlated with AnomalousHigh correlation
Anomalous is highly correlated with typeHigh correlation
Number_of_announcements is highly skewed (γ1 = 37.09375492) Skewed
Number_of_withdrawals is highly skewed (γ1 = 55.31501153) Skewed
Number_of_announced_NLRI_prefixes is highly skewed (γ1 = 29.32843764) Skewed
Number_of_withdrawn_NLRI_prefixes is highly skewed (γ1 = 76.1167729) Skewed
Number_of_implicit_withdrawals is highly skewed (γ1 = 21.9828425) Skewed
Number_of_Interior_Gateway_Protocol_IGP_packets is highly skewed (γ1 = 35.34201326) Skewed
Number_of_Exterior_Gateway_Protocol_EGP_packets is highly skewed (γ1 = 42.67050751) Skewed
Number_of_incomplete_packets is highly skewed (γ1 = 37.74118142) Skewed
Number_of_duplicate_announcements has 3191 (10.6%) zeros Zeros
Number_of_implicit_withdrawals has 4576 (15.1%) zeros Zeros
Number_of_Exterior_Gateway_Protocol_EGP_packets has 27309 (90.3%) zeros Zeros
Number_of_incomplete_packets has 2446 (8.1%) zeros Zeros

Variables

Number_of_announcements
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct count939
Unique (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean131.802950125678
Minimum5
Maximum24122
Zeros0
Zeros (%)0.0%
Memory size236.2 KiB

Quantile statistics

Minimum5
5-th percentile30
Q152
median81
Q3143
95-th percentile309
Maximum24122
Range24117
Interquartile range (IQR)91

Descriptive statistics

Standard deviation382.2452394
Coefficient of variation (CV)2.900126583
Kurtosis1874.825962
Mean131.8029501
Median Absolute Deviation (MAD)36
Skewness37.09375492
Sum3985194
Variance146111.4231
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
503601.2%
 
493341.1%
 
583311.1%
 
523281.1%
 
593221.1%
 
463221.1%
 
573181.1%
 
543171.0%
 
423171.0%
 
453141.0%
 
Other values (929)2697389.2%
 
ValueCountFrequency (%) 
51< 0.1%
 
61< 0.1%
 
81< 0.1%
 
94< 0.1%
 
107< 0.1%
 
ValueCountFrequency (%) 
241221< 0.1%
 
236561< 0.1%
 
222611< 0.1%
 
193361< 0.1%
 
149961< 0.1%
 

Number_of_withdrawals
Real number (ℝ≥0)

SKEWED

Distinct count44
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.3932067733827225
Minimum0
Maximum675
Zeros26
Zeros (%)0.1%
Memory size236.2 KiB

Quantile statistics

Minimum0
5-th percentile3
Q15
median6
Q38
95-th percentile15
Maximum675
Range675
Interquartile range (IQR)3

Descriptive statistics

Standard deviation7.74733755
Coefficient of variation (CV)1.047899482
Kurtosis4201.011117
Mean7.393206773
Median Absolute Deviation (MAD)2
Skewness55.31501153
Sum223541
Variance60.02123912
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6590619.5%
 
5495516.4%
 
7414113.7%
 
4326010.8%
 
823527.8%
 
315095.0%
 
911203.7%
 
1110043.3%
 
129753.2%
 
108902.9%
 
Other values (34)412413.6%
 
ValueCountFrequency (%) 
0260.1%
 
11370.5%
 
25331.8%
 
315095.0%
 
4326010.8%
 
ValueCountFrequency (%) 
6751< 0.1%
 
6471< 0.1%
 
5101< 0.1%
 
3511< 0.1%
 
3291< 0.1%
 

Number_of_announced_NLRI_prefixes
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct count2259
Unique (%)7.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean411.5626405609208
Minimum6
Maximum78495
Zeros0
Zeros (%)0.0%
Memory size236.2 KiB

Quantile statistics

Minimum6
5-th percentile52
Q1111
median214
Q3414
95-th percentile1155.25
Maximum78495
Range78489
Interquartile range (IQR)303

Descriptive statistics

Standard deviation1272.639343
Coefficient of variation (CV)3.092212989
Kurtosis1275.875448
Mean411.5626406
Median Absolute Deviation (MAD)123
Skewness29.32843764
Sum12444008
Variance1619610.897
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1001320.4%
 
881200.4%
 
1081190.4%
 
741180.4%
 
861150.4%
 
991150.4%
 
951130.4%
 
1061130.4%
 
601130.4%
 
801120.4%
 
Other values (2249)2906696.1%
 
ValueCountFrequency (%) 
61< 0.1%
 
111< 0.1%
 
121< 0.1%
 
134< 0.1%
 
143< 0.1%
 
ValueCountFrequency (%) 
784951< 0.1%
 
666211< 0.1%
 
632831< 0.1%
 
561651< 0.1%
 
418131< 0.1%
 

Number_of_withdrawn_NLRI_prefixes
Real number (ℝ≥0)

SKEWED

Distinct count602
Unique (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean64.27401111258104
Minimum0
Maximum82739
Zeros26
Zeros (%)0.1%
Memory size236.2 KiB

Quantile statistics

Minimum0
5-th percentile7
Q117
median30
Q354
95-th percentile152
Maximum82739
Range82739
Interquartile range (IQR)37

Descriptive statistics

Standard deviation861.4705358
Coefficient of variation (CV)13.4030928
Kurtosis6392.129618
Mean64.27401111
Median Absolute Deviation (MAD)16
Skewness76.1167729
Sum1943389
Variance742131.484
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
126872.3%
 
196632.2%
 
176602.2%
 
156572.2%
 
186452.1%
 
146412.1%
 
116402.1%
 
136382.1%
 
206362.1%
 
246322.1%
 
Other values (592)2373778.5%
 
ValueCountFrequency (%) 
0260.1%
 
1550.2%
 
21220.4%
 
31700.6%
 
42250.7%
 
ValueCountFrequency (%) 
827391< 0.1%
 
785871< 0.1%
 
622341< 0.1%
 
421301< 0.1%
 
405801< 0.1%
 

Average_AS-path_length
Real number (ℝ≥0)

Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.171385103849715
Minimum4
Maximum10
Zeros0
Zeros (%)0.0%
Memory size236.2 KiB

Quantile statistics

Minimum4
5-th percentile5
Q16
median6
Q36
95-th percentile7
Maximum10
Range6
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6176362435
Coefficient of variation (CV)0.1000806518
Kurtosis1.581695649
Mean6.171385104
Median Absolute Deviation (MAD)0
Skewness0.5895580897
Sum186598
Variance0.3814745292
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
62009266.5%
 
7668022.1%
 
528189.3%
 
85831.9%
 
9440.1%
 
1010< 0.1%
 
49< 0.1%
 
ValueCountFrequency (%) 
49< 0.1%
 
528189.3%
 
62009266.5%
 
7668022.1%
 
85831.9%
 
ValueCountFrequency (%) 
1010< 0.1%
 
9440.1%
 
85831.9%
 
7668022.1%
 
62009266.5%
 

Maximum_AS-path_length
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count34
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.117839661330864
Minimum5
Maximum107
Zeros0
Zeros (%)0.0%
Memory size236.2 KiB

Quantile statistics

Minimum5
5-th percentile8
Q111
median13
Q315
95-th percentile19
Maximum107
Range102
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.577492145
Coefficient of variation (CV)0.2727196122
Kurtosis102.412865
Mean13.11783966
Median Absolute Deviation (MAD)2
Skewness4.586089785
Sum396631
Variance12.79845005
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
12402813.3%
 
11370512.3%
 
13343511.4%
 
14342411.3%
 
1028819.5%
 
1528669.5%
 
1620986.9%
 
919726.5%
 
1716475.4%
 
811033.6%
 
Other values (24)307710.2%
 
ValueCountFrequency (%) 
55< 0.1%
 
6470.2%
 
75221.7%
 
811033.6%
 
919726.5%
 
ValueCountFrequency (%) 
1071< 0.1%
 
1061< 0.1%
 
1054< 0.1%
 
1041< 0.1%
 
411< 0.1%
 

Average_unique_AS-path_length
Real number (ℝ≥0)

Distinct count8
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.182365392247652
Minimum4
Maximum11
Zeros0
Zeros (%)0.0%
Memory size236.2 KiB

Quantile statistics

Minimum4
5-th percentile5
Q16
median6
Q37
95-th percentile7
Maximum11
Range7
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.6172761461
Coefficient of variation (CV)0.09984465603
Kurtosis1.371464145
Mean6.182365392
Median Absolute Deviation (MAD)0
Skewness0.5357796443
Sum186930
Variance0.3810298405
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
61986265.7%
 
7700623.2%
 
527519.1%
 
85591.8%
 
9440.1%
 
48< 0.1%
 
105< 0.1%
 
111< 0.1%
 
ValueCountFrequency (%) 
48< 0.1%
 
527519.1%
 
61986265.7%
 
7700623.2%
 
85591.8%
 
ValueCountFrequency (%) 
111< 0.1%
 
105< 0.1%
 
9440.1%
 
85591.8%
 
7700623.2%
 

Number_of_duplicate_announcements
Real number (ℝ≥0)

ZEROS

Distinct count725
Unique (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.776160867839664
Minimum0
Maximum5516
Zeros3191
Zeros (%)10.6%
Memory size236.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median6
Q320
95-th percentile185
Maximum5516
Range5516
Interquartile range (IQR)18

Descriptive statistics

Standard deviation119.8170179
Coefficient of variation (CV)3.445377952
Kurtosis378.5804252
Mean34.77616087
Median Absolute Deviation (MAD)5
Skewness13.39534628
Sum1051492
Variance14356.11778
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1321810.6%
 
0319110.6%
 
227339.0%
 
322347.4%
 
418336.1%
 
514574.8%
 
611823.9%
 
710363.4%
 
88162.7%
 
97412.5%
 
Other values (715)1179539.0%
 
ValueCountFrequency (%) 
0319110.6%
 
1321810.6%
 
227339.0%
 
322347.4%
 
418336.1%
 
ValueCountFrequency (%) 
55161< 0.1%
 
50081< 0.1%
 
44451< 0.1%
 
27691< 0.1%
 
27241< 0.1%
 

Number_of_duplicate_withdrawals
Real number (ℝ≥0)

Distinct count2094
Unique (%)6.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean301.9616351369229
Minimum0
Maximum27624
Zeros8
Zeros (%)< 0.1%
Memory size236.2 KiB

Quantile statistics

Minimum0
5-th percentile21
Q158
median132
Q3324
95-th percentile1094
Maximum27624
Range27624
Interquartile range (IQR)266

Descriptive statistics

Standard deviation597.0329858
Coefficient of variation (CV)1.97718159
Kurtosis375.0461343
Mean301.9616351
Median Absolute Deviation (MAD)92
Skewness12.84693036
Sum9130112
Variance356448.3862
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
351890.6%
 
401880.6%
 
281800.6%
 
441800.6%
 
371770.6%
 
341770.6%
 
551760.6%
 
391740.6%
 
501730.6%
 
361730.6%
 
Other values (2084)2844994.1%
 
ValueCountFrequency (%) 
08< 0.1%
 
1160.1%
 
2220.1%
 
314< 0.1%
 
4260.1%
 
ValueCountFrequency (%) 
276241< 0.1%
 
256081< 0.1%
 
215721< 0.1%
 
144491< 0.1%
 
135441< 0.1%
 

Number_of_implicit_withdrawals
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count404
Unique (%)1.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.0839066014023
Minimum0
Maximum3688
Zeros4576
Zeros (%)15.1%
Memory size236.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median5
Q319
95-th percentile78
Maximum3688
Range3688
Interquartile range (IQR)18

Descriptive statistics

Standard deviation64.78471609
Coefficient of variation (CV)3.225702916
Kurtosis854.6452715
Mean20.0839066
Median Absolute Deviation (MAD)5
Skewness21.9828425
Sum607257
Variance4197.059439
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0457615.1%
 
1361912.0%
 
223227.7%
 
321337.1%
 
415725.2%
 
512284.1%
 
610903.6%
 
79253.1%
 
86942.3%
 
96042.0%
 
Other values (394)1147337.9%
 
ValueCountFrequency (%) 
0457615.1%
 
1361912.0%
 
223227.7%
 
321337.1%
 
415725.2%
 
ValueCountFrequency (%) 
36881< 0.1%
 
33381< 0.1%
 
26771< 0.1%
 
22011< 0.1%
 
19961< 0.1%
 

Average_edit_distance
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count33
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.848524937161
Minimum5
Maximum107
Zeros0
Zeros (%)0.0%
Memory size236.2 KiB

Quantile statistics

Minimum5
5-th percentile7
Q111
median13
Q315
95-th percentile19
Maximum107
Range102
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.748123368
Coefficient of variation (CV)0.2917162387
Kurtosis85.80269489
Mean12.84852494
Median Absolute Deviation (MAD)2
Skewness3.927857268
Sum388488
Variance14.04842878
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
12389712.9%
 
11349011.5%
 
13336711.1%
 
14325210.8%
 
1527689.2%
 
1027109.0%
 
1620376.7%
 
919196.3%
 
1715755.2%
 
811883.9%
 
Other values (23)403313.3%
 
ValueCountFrequency (%) 
5930.3%
 
66432.1%
 
79203.0%
 
811883.9%
 
919196.3%
 
ValueCountFrequency (%) 
1071< 0.1%
 
1061< 0.1%
 
1054< 0.1%
 
1041< 0.1%
 
346< 0.1%
 

Maximum_edit_distance
Real number (ℝ≥0)

Distinct count184
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.67575076068263
Minimum0.1
Maximum900.0
Zeros0
Zeros (%)0.0%
Memory size236.2 KiB

Quantile statistics

Minimum0.1
5-th percentile0.5
Q10.9
median1.5
Q32.8
95-th percentile7.5
Maximum900
Range899.9
Interquartile range (IQR)1.9

Descriptive statistics

Standard deviation33.07775173
Coefficient of variation (CV)5.82790773
Kurtosis248.458843
Mean5.675750761
Median Absolute Deviation (MAD)0.7
Skewness13.99376123
Sum171612
Variance1094.13766
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.918776.2%
 
0.818146.0%
 
0.716625.5%
 
1.116025.3%
 
114754.9%
 
1.213844.6%
 
0.613324.4%
 
1.311373.8%
 
1.410203.4%
 
0.59763.2%
 
Other values (174)1595752.8%
 
ValueCountFrequency (%) 
0.15< 0.1%
 
0.2700.2%
 
0.31870.6%
 
0.45221.7%
 
0.59763.2%
 
ValueCountFrequency (%) 
9004< 0.1%
 
8005< 0.1%
 
7003< 0.1%
 
60012< 0.1%
 
500160.1%
 

Inter-arrival_time
Real number (ℝ≥0)

Distinct count12
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.233959518454822
Minimum2
Maximum15
Zeros0
Zeros (%)0.0%
Memory size236.2 KiB

Quantile statistics

Minimum2
5-th percentile4
Q16
median6
Q37
95-th percentile8
Maximum15
Range13
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.215507679
Coefficient of variation (CV)0.1949816445
Kurtosis3.950632456
Mean6.233959518
Median Absolute Deviation (MAD)1
Skewness-1.437071354
Sum188490
Variance1.477458917
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
61310243.3%
 
71044334.5%
 
525888.6%
 
821027.0%
 
210903.6%
 
33751.2%
 
92630.9%
 
42320.8%
 
10350.1%
 
114< 0.1%
 
Other values (2)2< 0.1%
 
ValueCountFrequency (%) 
210903.6%
 
33751.2%
 
42320.8%
 
525888.6%
 
61310243.3%
 
ValueCountFrequency (%) 
151< 0.1%
 
121< 0.1%
 
114< 0.1%
 
10350.1%
 
92630.9%
 

Maximum_edit_distance_7
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
26531
1
 
3705
ValueCountFrequency (%) 
02653187.7%
 
1370512.3%
 

Maximum_edit_distance_8
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
26208
1
 
4028
ValueCountFrequency (%) 
02620886.7%
 
1402813.3%
 

Maximum_edit_distance_9
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
26801
1
 
3435
ValueCountFrequency (%) 
02680188.6%
 
1343511.4%
 

Maximum_edit_distance_10
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
26812
1
 
3424
ValueCountFrequency (%) 
02681288.7%
 
1342411.3%
 

Maximum_edit_distance_11
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
27379
1
 
2857
ValueCountFrequency (%) 
02737990.6%
 
128579.4%
 

Maximum_edit_distance_12
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
28138
1
 
2098
ValueCountFrequency (%) 
02813893.1%
 
120986.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
28589
1
 
1647
ValueCountFrequency (%) 
02858994.6%
 
116475.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
29574
1
 
662
ValueCountFrequency (%) 
02957497.8%
 
16622.2%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
29632
1
 
604
ValueCountFrequency (%) 
02963298.0%
 
16042.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
29851
1
 
385
ValueCountFrequency (%) 
02985198.7%
 
13851.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
29316
1
 
920
ValueCountFrequency (%) 
02931697.0%
 
19203.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
29054
1
 
1182
ValueCountFrequency (%) 
02905496.1%
 
111823.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
28333
1
 
1903
ValueCountFrequency (%) 
02833393.7%
 
119036.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
27526
1
 
2710
ValueCountFrequency (%) 
02752691.0%
 
127109.0%
 

Maximum_AS-path_length_10
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
26746
1
 
3490
ValueCountFrequency (%) 
02674688.5%
 
1349011.5%
 

Maximum_AS-path_length_11
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
26339
1
 
3897
ValueCountFrequency (%) 
02633987.1%
 
1389712.9%
 

Maximum_AS-path_length_12
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
26869
1
 
3367
ValueCountFrequency (%) 
02686988.9%
 
1336711.1%
 

Maximum_AS-path_length_13
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
26984
1
 
3252
ValueCountFrequency (%) 
02698489.2%
 
1325210.8%
 

Maximum_AS-path_length_14
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
27477
1
 
2759
ValueCountFrequency (%) 
02747790.9%
 
127599.1%
 

Maximum_AS-path_length_15
Boolean

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
0
28199
1
 
2037
ValueCountFrequency (%) 
02819993.3%
 
120376.7%
 

Number_of_Interior_Gateway_Protocol_IGP_packets
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED

Distinct count962
Unique (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.8407527450721
Minimum0
Maximum22285
Zeros31
Zeros (%)0.1%
Memory size236.2 KiB

Quantile statistics

Minimum0
5-th percentile26
Q147
median73
Q3137
95-th percentile324
Maximum22285
Range22285
Interquartile range (IQR)90

Descriptive statistics

Standard deviation360.2402315
Coefficient of variation (CV)2.840098499
Kurtosis1734.362969
Mean126.8407527
Median Absolute Deviation (MAD)35
Skewness35.34201326
Sum3835157
Variance129773.0244
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
503691.2%
 
433561.2%
 
483461.1%
 
423451.1%
 
523431.1%
 
453431.1%
 
513371.1%
 
583341.1%
 
463341.1%
 
573331.1%
 
Other values (952)2679688.6%
 
ValueCountFrequency (%) 
0310.1%
 
21< 0.1%
 
41< 0.1%
 
72< 0.1%
 
83< 0.1%
 
ValueCountFrequency (%) 
222851< 0.1%
 
218011< 0.1%
 
204541< 0.1%
 
178601< 0.1%
 
137971< 0.1%
 

Number_of_Exterior_Gateway_Protocol_EGP_packets
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count47
Unique (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2974930546368567
Minimum0
Maximum218
Zeros27309
Zeros (%)90.3%
Memory size236.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2
Maximum218
Range218
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.395954986
Coefficient of variation (CV)8.053818229
Kurtosis2913.915131
Mean0.2974930546
Median Absolute Deviation (MAD)0
Skewness42.67050751
Sum8995
Variance5.740600297
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
02730990.3%
 
111343.8%
 
28572.8%
 
33951.3%
 
42120.7%
 
5880.3%
 
6620.2%
 
7410.1%
 
8220.1%
 
9190.1%
 
Other values (37)970.3%
 
ValueCountFrequency (%) 
02730990.3%
 
111343.8%
 
28572.8%
 
33951.3%
 
42120.7%
 
ValueCountFrequency (%) 
2181< 0.1%
 
1271< 0.1%
 
1071< 0.1%
 
1041< 0.1%
 
791< 0.1%
 

Number_of_incomplete_packets
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct count196
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.917052520174627
Minimum0
Maximum1807
Zeros2446
Zeros (%)8.1%
Memory size236.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14
median10
Q316
95-th percentile32
Maximum1807
Range1807
Interquartile range (IQR)12

Descriptive statistics

Standard deviation29.03666698
Coefficient of variation (CV)2.247932873
Kurtosis1966.313662
Mean12.91705252
Median Absolute Deviation (MAD)6
Skewness37.74118142
Sum390560
Variance843.1280294
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
024468.1%
 
314724.9%
 
614564.8%
 
514484.8%
 
714284.7%
 
414204.7%
 
214154.7%
 
813664.5%
 
1013214.4%
 
1112974.3%
 
Other values (186)1516750.2%
 
ValueCountFrequency (%) 
024468.1%
 
111433.8%
 
214154.7%
 
314724.9%
 
414204.7%
 
ValueCountFrequency (%) 
18071< 0.1%
 
17961< 0.1%
 
17771< 0.1%
 
14501< 0.1%
 
12101< 0.1%
 

Packet_size_bytes
Real number (ℝ≥0)

Distinct count1506
Unique (%)5.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean661.9944437094854
Minimum0
Maximum14975
Zeros31
Zeros (%)0.1%
Memory size236.2 KiB

Quantile statistics

Minimum0
5-th percentile221
Q1240
median265
Q3299
95-th percentile506.5
Maximum14975
Range14975
Interquartile range (IQR)59

Descriptive statistics

Standard deviation1761.450626
Coefficient of variation (CV)2.660823883
Kurtosis17.52735114
Mean661.9944437
Median Absolute Deviation (MAD)28
Skewness4.362501777
Sum20016064
Variance3102708.308
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
2413911.3%
 
2403841.3%
 
2393681.2%
 
2343671.2%
 
2433611.2%
 
2383571.2%
 
2493541.2%
 
2443511.2%
 
2363511.2%
 
2503501.2%
 
Other values (1496)2660288.0%
 
ValueCountFrequency (%) 
0310.1%
 
681< 0.1%
 
811< 0.1%
 
1171< 0.1%
 
1211< 0.1%
 
ValueCountFrequency (%) 
149751< 0.1%
 
142431< 0.1%
 
139711< 0.1%
 
139571< 0.1%
 
136971< 0.1%
 

Anomalous
Categorical

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
-1
25243
1
 
4993
ValueCountFrequency (%) 
-12524383.5%
 
1499316.5%
 

Length

Max length2
Median length2
Mean length1.834865723
Min length1

type
Categorical

HIGH CORRELATION

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
normal
25243
anomaly
 
4993
ValueCountFrequency (%) 
normal2524383.5%
 
anomaly499316.5%
 

Length

Max length7
Median length6
Mean length6.165134277
Min length6

category
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size236.2 KiB
Code_Red_I
7200
Nimda
7200
Slammer
7200
RIPE_regular
7196
BCNET_regular
 
1440
ValueCountFrequency (%) 
Code_Red_I720023.8%
 
Nimda720023.8%
 
Slammer720023.8%
 
RIPE_regular719623.8%
 
BCNET_regular14404.8%
 

Length

Max length13
Median length10
Mean length8.713851038
Min length5

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

Number_of_announcementsNumber_of_withdrawalsNumber_of_announced_NLRI_prefixesNumber_of_withdrawn_NLRI_prefixesAverage_AS-path_lengthMaximum_AS-path_lengthAverage_unique_AS-path_lengthNumber_of_duplicate_announcementsNumber_of_duplicate_withdrawalsNumber_of_implicit_withdrawalsAverage_edit_distanceMaximum_edit_distanceInter-arrival_timeMaximum_edit_distance_7Maximum_edit_distance_8Maximum_edit_distance_9Maximum_edit_distance_10Maximum_edit_distance_11Maximum_edit_distance_12Maximum_edit_distance_13Maximum_edit_distance_14Maximum_edit_distance_15Maximum_edit_distance_16Maximum_edit_distance_17Maximum_AS-path_length_7Maximum_AS-path_length_8Maximum_AS-path_length_9Maximum_AS-path_length_10Maximum_AS-path_length_11Maximum_AS-path_length_12Maximum_AS-path_length_13Maximum_AS-path_length_14Maximum_AS-path_length_15Number_of_Interior_Gateway_Protocol_IGP_packetsNumber_of_Exterior_Gateway_Protocol_EGP_packetsNumber_of_incomplete_packetsPacket_size_bytesAnomaloustypecategory
0578203166156206150206100.020000100000000000000060008079-1normalBCNET_regular
1622336175616639835512061.130000010000000000000068009515-1normalBCNET_regular
274123982361264333232871.230100000000100000000075008632-1normalBCNET_regular
37045434962765682107281.130000000000010000000072009227-1normalBCNET_regular
45143474585439263560.820000000000000000000052007831-1normalBCNET_regular
572164695361265393368661.230100000000000000000077008694-1normalBCNET_regular
66373742661055182382771.020000000000100000000065008040-1normalBCNET_regular
77715554188616669947358371.330000010000100000000079009310-1normalBCNET_regular
8729535119610663672118271.230000000000100000000074009180-1normalBCNET_regular
96192686451052931466861.020000000000000000000063007218-1normalBCNET_regular

Last rows

Number_of_announcementsNumber_of_withdrawalsNumber_of_announced_NLRI_prefixesNumber_of_withdrawn_NLRI_prefixesAverage_AS-path_lengthMaximum_AS-path_lengthAverage_unique_AS-path_lengthNumber_of_duplicate_announcementsNumber_of_duplicate_withdrawalsNumber_of_implicit_withdrawalsAverage_edit_distanceMaximum_edit_distanceInter-arrival_timeMaximum_edit_distance_7Maximum_edit_distance_8Maximum_edit_distance_9Maximum_edit_distance_10Maximum_edit_distance_11Maximum_edit_distance_12Maximum_edit_distance_13Maximum_edit_distance_14Maximum_edit_distance_15Maximum_edit_distance_16Maximum_edit_distance_17Maximum_AS-path_length_7Maximum_AS-path_length_8Maximum_AS-path_length_9Maximum_AS-path_length_10Maximum_AS-path_length_11Maximum_AS-path_length_12Maximum_AS-path_length_13Maximum_AS-path_length_14Maximum_AS-path_length_15Number_of_Interior_Gateway_Protocol_IGP_packetsNumber_of_Exterior_Gateway_Protocol_EGP_packetsNumber_of_incomplete_packetsPacket_size_bytesAnomaloustypecategory
30226281441686124080.46000000000001000000001909219-1normalSlammer
30227294931871270455120.56010000000000000100002801266-1normalSlammer
30228476761261161442110.86100000000000001000004205221-1normalSlammer
30229615137277197136810191.17000000001000000000005902248-1normalSlammer
3023051479661976641190.97000000001000000000005001220-1normalSlammer
302314459021618613582180.86000000010000000000003905243-1normalSlammer
3023221139161160130110.36100000000000001000001803222-1normalSlammer
3023317349661160110110.37100000000000001000001601241-1normalSlammer
3023427558761964191190.57000000001000000000002007223-1normalSlammer
30235247431561963122190.56000000001000000000002103218-1normalSlammer

Duplicate rows

Most frequent

Number_of_announcementsNumber_of_withdrawalsNumber_of_announced_NLRI_prefixesNumber_of_withdrawn_NLRI_prefixesAverage_AS-path_lengthMaximum_AS-path_lengthAverage_unique_AS-path_lengthNumber_of_duplicate_announcementsNumber_of_duplicate_withdrawalsNumber_of_implicit_withdrawalsAverage_edit_distanceMaximum_edit_distanceInter-arrival_timeMaximum_edit_distance_7Maximum_edit_distance_8Maximum_edit_distance_9Maximum_edit_distance_10Maximum_edit_distance_11Maximum_edit_distance_12Maximum_edit_distance_13Maximum_edit_distance_14Maximum_edit_distance_15Maximum_edit_distance_16Maximum_edit_distance_17Maximum_AS-path_length_7Maximum_AS-path_length_8Maximum_AS-path_length_9Maximum_AS-path_length_10Maximum_AS-path_length_11Maximum_AS-path_length_12Maximum_AS-path_length_13Maximum_AS-path_length_14Maximum_AS-path_length_15Number_of_Interior_Gateway_Protocol_IGP_packetsNumber_of_Exterior_Gateway_Protocol_EGP_packetsNumber_of_incomplete_packetsPacket_size_bytesAnomaloustypecategorycount
1142217469602090.260000000000000000000000001anomalyNimda9
31106246657157811915151.980000000000000000000000001anomalyNimda9
2862292176964134091.460000000000000000000000001anomalyNimda7
010222268601080.25000000000000000000000000-1normalNimda6